Intensional RDB Manifesto: a Unifying NewSQL Model for Flexible Big Data
نویسنده
چکیده
In this paper we present a new family of Intensional RDBs (IRDBs) which extends the traditional RDBs with the Big Data and flexible and ’Open schema’ features, able to preserve the user-defined relational database schemas and all preexisting user’s applications containing the SQL statements for a deployment of such a relational data. The standard RDB data is parsed into an internal vector key/value relation, so that we obtain a column representation of data used in Big Data applications, covering the key/value and column-based Big Data applications as well, into a unifying RDB framework. We define a query rewriting algorithm, based on the GAV Data Integration methods, so that each user-defined SQL query is rewritten into a SQL query over this vector relation, and hence the user-defined standard RDB schema is maintained as an empty global schema for the RDB schema modeling of data and as the SQL interface to stored vector relation. Such an IRDB architecture is adequate for the massive migrations from the existing slow RDBMSs into this new family of fast IRDBMSs by offering a Big Data and new flexible schema features as well.
منابع مشابه
Intensional RDB for Big Data Interoperability
A new family of Intensional RDBs (IRDBs), introduced in [1], extends the traditional RDBs with the Big Data and flexible and ’Open schema’ features, able to preserve the user-defined relational database schemas and all preexisting user’s applications containing the SQL statements for a deployment of such a relational data. The standard RDB data is parsed into an internal vector key/value relati...
متن کاملNewSQL: Towards Next-Generation Scalable RDBMS for Online Transaction Processing (OLTP) for Big Data Management
One of the key advances in resolving the “big-data” problem has been the emergence of an alternative database technology. Today, classic RDBMS are complemented by a rich set of alternative Data Management Systems (DMS) specially designed to handle the volume, variety, velocity and variability of Big Data collections; these DMS include NoSQL, NewSQL and Search-based systems. NewSQL is a class of...
متن کاملEmerging Technologies For Big Data Processing: NOSQL And NEWSQL Data Stores
In this incessant science and technological era, where advances in web technology and the production of mobile devices and sensors connected to the Internet are resulting to voluminous amount of structured, semi-structured and unstructured data, called Big Data, the demand for technologies with extensive processing and storage requirements is rising to persuasively process such data i.e. Big Da...
متن کاملThe SQL++ Unifying Semi-structured Query Language, and an Expressiveness Benchmark of SQL-on-Hadoop, NoSQL and NewSQL Databases
SQL-on-Hadoop, NewSQL and NoSQL databases provide semi-structured data models (typically JSON based) and respective query languages. Lack of formal syntax and semantics, idiomatic (nonSQL) language constructs and large variations in syntax, semantics and actual capabilities pose problems even to database experts: It is hard to understand, compare and use these languages. It is especially tediou...
متن کاملNoSQL Database: New Era of Databases for Big data Analytics - Classification, Characteristics and Comparison
Digital world is growing very fast and become more complex in the volume (terabyte to petabyte), variety (structured and un-structured and hybrid), velocity (high speed in growth) in nature. This refers to as ‘Big Data’ that is a global phenomenon. This is typically considered to be a data collection that has grown so large it can’t be effectively managed or exploited using conventional data ma...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1403.0017 شماره
صفحات -
تاریخ انتشار 2014